Identifying Structure across Pre-partitioned Data

نویسندگان

  • Zvika Marx
  • Ido Dagan
  • Eli Shamir
چکیده

We propose an information-theoretic clustering approach that incorporates a pre-known partition of the data, aiming to identify common clusters that cut across the given partition. In the standard clustering setting the formation of clusters is guided by a single source of feature information. The newly utilized pre-partition factor introduces an additional bias that counterbalances the impact of the features whenever they become correlated with this known partition. The resulting algorithmic framework was applied successfully to synthetic data, as well as to identifying text-based cross-religion correspondences.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Privacy Preserving Association Rule Mining in Vertically Partitioned Data

Data mining technology has emerged as a means for identifying patterns and trends from large quantities of data. This paper presents privacy preserving association rule mining across vertically partitioned data. We present an efficient algorithm to discover association rules with minimum levels of support and confidence, from heterogeneous data distributed across 2 parties, while preventing eit...

متن کامل

L2 Teachers’ Representations of Classroom Management Events: Variations across Experience Levels

Knowledge representation, defined as the way individuals structure their knowledge and cognitive processing of events and the associated sense-making processes, is believed to influence teachers’ reasoning/thinking skills. While extensively researched in mainstream teacher education, this line of inquiry is essentially lacking in the L2 teacher education literature. To fill some of the void, th...

متن کامل

The determinants of capital structure across firms’ sizes: The U.K evidence

This paper explores the leverage determinants across firms’ sizesbased on the two main theories behind the capital structure, the trade-offand the pecking order theories. A panel data is sued to find therelationship between capital structure and the variables that proxy forbenefits and costs of debt during 1990 to 2006. Our findings show thatboth principles help to explain the capital structure...

متن کامل

Efficient Edge Noise Removal and Perceptual Feature Classification

Over-segmentation of edge features has been a challenging problem for many edge-based vision applications. Too many useless features are simply background noise which are costly for higher-level processing. The conventional methods of dealing with oversegmentation use various noise suppressing filters at pixel level for the entire image, and then form features by grouping identified edge points...

متن کامل

تخمین وفقی مرز کلاتر در کلاتر‌های ویبول با استفاده از پیش آشکارساز UMPI

In radar detection, the existence of the clutter edge in the reference samples considerably degrades the performance of the detector. Hence, clutter edge estimation not only improves the CFAR detectors, but also can be used for partitioning the various areas of the clutter in the clutter map. In this paper, we propose an adaptive algorithm for detecting the clutter edge between two Weibull clut...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003